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Abstract 

Our model is a generalized linear programming relaxation of a much studied random K-SAT 
problem. Specifically, a set of linear constraints C on K variables is fixed. From a pool of n variables, 
K variables are chosen uniformly at random and a constraint is chosen from C also uniformly at 
random. This procedure is repeated m times independently. We are interested in whether the 
resulting linear programming problem is feasible. We prove that the feasibility property experiences 
a linear phase transition, when n — ► oo and m = cn for a constant c. Namely, there exists a critical 
value c* such that, when c < c* , the problem is feasible or is asymptotically almost feasible, as n — » oo, 
but, when c > c*, the "distance" to feasibility is at least a positive constant independent of n. Our 
result is obtained using the combination of a powerful local weak convergence method developed in 
Aldous |Ald92| . jAldOlj . Aldous and Steele |AS03| . Steele |Ste02j and martingale techniques. 

By exploiting a linear programming duality, our theorem implies the following result in the context 
of sparse random graphs G(n, cn) on n nodes with cn edges, where edges are equipped with randomly 
generated weights. Let A4(n,c) denote maximum weight matching in G(n,cn). We prove that when 
c is a constant and n — > oo, the limit linin^oo M(n, c)/n, exists, with high probability. We further 
extend this result to maximum weight 6-matchings also in G(n,cn). 

1 Introduction 

The primary objective of the present paper is studying randomly generated linear programming prob- 
lems. We are interested in scaling behavior of the corresponding objective value and some phase 
transition properties, as the size of the problem diverges to infinity. Our random linear programming 
problems are generated in a specific way. In particular, our linear programs have a fixed number of 
variables per constraint and the number of variables and constraints diverges to infinity in such a way 
that their ratio stays a constant. 

Our motivation to consider this specific class of random linear programs has several sources. The 
main motivation is recent explosion of interest in random instances of boolean satisfiability (K-SAT) 
problems and ensuing phase transition phenomenon. The main outstanding conjecture in this field states 
that the satisfiability property of random K-SAT problem experiences a linear phase transition as the 
function of the ratio of the number of clauses to the number of variables. Our linear programming 
problem can be viewed as a generalized linear programming relaxation of the integer programming 
formulation of such random K-SAT problem. 
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Tightly related to the K-SAT problem are problems of maximal cardinality cuts, independent sets, 
matchings and other objects, in sparse random graphs G(n, [cn\), which are graphs on n nodes and 
[cn\ edges selected uniformly at random from all the possible edges, and where c > is some fixed 
constant. For future we drop the annoying notation [_-J , assuming that cn is always an integer. It is easy 
to show that all of these objects scale linearly in n. It is conjectured that the size of each such object 
divided by n converges to a constant, independent of n. This convergence is established only for the 
case of maximal matchings using direct methods |KS81| . where the limit can be computed explicitly, 
but is open in other cases. 

The main result of this paper states that the objective value of the random linear programming 
problem we consider, when divided by the number of variables converges with high probability (w.h.p.) 
to a certain limit. As a corollary we prove that, suitably defined, distance to feasibility in the same 
random linear programming problem experiences a linear phase transition, just as conjectured for 
random K-SAT problem. Furthermore, we show that, in a special case, the dual of this random linear 
programming problem is a linear programming relaxation of the maximum cardinality matching and 
more generally 6-matching (defined later) problems in G(n,cn). We show that these relaxations are 
asymptotically tight as the number of nodes n diverges to infinity. As a corollary of our main result, we 
prove that maximum cardinality 6-matching when divided by n converge to a constant. These results 
hold even in the weighted version, where edges are equipped with randomly independently generated 
non-negative weights. 

Our proof technique is a combination of a very powerful local weak convergence method and martin- 
gale techniques. The local weak convergence method was developed in Aldous |2El92 , AldOl , Aldous 
and Steele |AS03| . Steele |Ste02j . The method was specifically used by Aldous for proving the £(2) con- 
jecture for the random assignment problem. It was used in |Ald92| to prove that the expected minimum 
weight matching in a complete bipartite graph converges to a certain constant. Later Aldous proved 

AldOj that this limit is indeed C(2), as conjectured earlier by Mezard and Parisi MP87 . Since then 
the local weak convergence method was used for other problems (see |AS03j for a survey), and seems 
to be a very useful method for proving existence of limits in problems like the ones we described, and 
in some instances also leads to actually computing the limits of interest. By an analogy with the per- 
colation literature, we call these problems existence and computation of scaling limits in large random 
combinatorial structures. Such questions, many of them open, abound in percolation literature. For 
example the existence of limits of crossing probabilities in critical percolation have been established 
in several percolation models like triangular percolation using conformal invariance techniques jSchOlj . 

SWOj, but are still open in the case of other lattices, like rectangular bond and site critical percolation, 
see Langlands LPSA94 j. Whether a local weak convergence is a useful technique for addressing these 
questions seems worth investigation. 

To the extend that we know, our result is the first application of the local weak convergence method 
to establishing phase transitions in random combinatorial structures. In the following section we describe 
in details randomly generated combinatorial problems we mentioned above, describe the existing results 
in the literature and list some outstanding conjectures. In Section |3] we describe our model and state 
our main results. We also give a short summary of the proof steps. Sections El are devoted to the 
proof of our main result. Section |HJ is devoted to the applications to the maximum weight matching 
and 6-matching in sparse random graphs. Section is devoted to conclusions and some open problems. 
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2 Background: random K-SAT, sparse random graphs and scaling 
limits 



2.1 Random K-SAT problem 

A satisfiability or K-SAT problem is a boolean constraint satisfaction problem with a special form. A 
collection of n variables xi,X2, ... ,x n with values in {0, 1} is fixed. A boolean formulae of the form 
C\ AC2 A • • • f\C m is constructed, where each C{ is a disjunctive clause of the form Vxj 2 Vxj 3 V ■ ■ -\/Xi K , 
where exactly K variables are taken from the pool x±, . . . , x n , some with negation, some without. The 
formulae is defined to be satisfiable if an assignment of variables Xi,i = l,...,n to or 1 can be 
constructed such that all the clauses take value 1. The K-SAT problem is one of the most famous 
combinatorial optimization problem, see [PS98 . It is well known that the satisfiability problem is 
solvable in polynomial time for K = 2, and is NP-complete for K > 3. 

Recently we have witnessed an explosion of interest in random instances of the K-SAT problem. 
This was motivated by computer science, artificial intelligence and statistical physics investigations, 
with phase transition phenomena becoming the focus of a particular attention. A random instance of 
a K-SAT problem with m clauses and n variables is obtained by selecting each clause uniformly at 

random from the entire collection of ^-w- (2 K ( !^ )) possible clauses where repetition of variables is 

K 

(is not) allowed. In particular, for each j = 1,2, ... ,m, K variables %i x , Xj 2 , . . . , Xi K or their negations 
are selected uniformly at random from the pool x\ , 1 < i < n to form a clause Cj = y% x V . . .yi k , where 
each yi r = Xi r or Xj r , equiprobably. This is done for all j = l,...,m independently. Whether the 
resulting formulae has a satisfying assignment {x±, . . . , x n } — > {0,1}" becomes a random event with 
respect to this random construction. The main outstanding conjecture for the random K-SAT problem 
is as follows. 

Conjecture 1 For every K > 2 there exists a constant c* K such that a random K-SAT formulae with 
n variables and m = cn clauses is satisfiable when c < c* K and is not satisfiable when c > c* K , w.h.p. as 
n —* 00. In other words, the satisfiability experiences a linear sharp phase transition at m = c* K n. 

That the problem experiences a sharp phase transition is proven by Friedghut Fri99 in a much 
more general context. It is the linearity which is the main outstanding feature of this conjecture. 
The conjecture can be rephrased as follows: there does not exist c\ > C2 and two infinite sequences 
nj; ,nf ,t = 1,2,..., such that instances of K-SAT problem with variables and cin^ clauses 
are satisfiable w.h.p., but instances with n? variables and C2n^ clauses are not satisfiable w.h.p., as 
t — > 00. One of the goals of our paper is to establish an analogue of this conjecture for generalized 
linear programming relaxations of the integer programming formulation (to be described below) of the 
random K-SAT problem. 

Conjecture n is proven for the case K = 2. Specifically, c| = 1 was established by Goerdt |Goe92j . 
Goe96 , Chvatal and Reed fGR92] . Fernandez de la Vega [dlV92j . For higher values of K many pro- 
gressively sharper bounds on c* K (assuming it exists) are established by now. For K = 3 the best known 
upper and lower bounds are 4.506 and 3.42, obtained by Dubois, Boufkhad and Mandler DBM00 , and 
Kaporis, Kirousis and Lalas KKL02 , respectively. It is known that c* K , if exists, approaches asymp- 
totically 2 x (log2 + o(l)) when K is large, [ATa| . See also |AM02h| . |FW02j for the related results. 
Talagrand |Tal01j approached the random K-SAT problem using the methods of statistical physics. 

The interest in random K-SAT problem does not stop at the threshold value c* K . For c > c* K 
(assuming ConjectureHholds), a natural question is what is the maximal number of clauses N(n, m) < m 
that can be satisfied by a single assignment of the n variables? It is shown in Coppersmith et al CGHS03 
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that for K = 2 and every c > 1, there exists a constant a(c) > such that N(n,cn) < (c — a(c))n, 
w.h.p. The following conjecture from ( CGHS03 ) then naturally extends Conjecture ^ 

Conjecture 2 Assuming Conjecture Q] holds, linij^oo N ^ n ' cn ^ exists and is smaller than one, for all 
c>c* K . 

In Section |3] we introduce a conjecture similar to the one above with respect to random linear 
programs. 

2.2 Matching and 6-matching in G(n, cn) 

Let G be a simple undirected graph on n nodes {1, 2, . . . , n} = [n] with the edge set E. A set of nodes 
V C [n] in this graph is an independent set if no two nodes in V are connected by an edge. A partition 
of nodes [n] into two k groups V%, V%, . . . , Vk such that Ux<j<fcVi = N> H Vi 2 = for all i\ ^ 12, is 
defined to be a /c-cut. The size of the fc-cut is the total number of edges whose end points belong to 
different sets V%. When k = 2, the fc-cuts are simply referred to as cuts. 

A matching is a collection of edges such that no two edges are incident to the same node. The size 
of the matching is the number of edges in it. A path is a collection of distinct nodes C = {i±, . . . ,ik} 
such that the edges (ii, fa, 13), ■ ■ ■ , (ife-i,ifc) belong to the edge set E. A cycle is a collection of 
distinct nodes C = {h, . . . ,ik} such that the edges (11,12), (*2, ^3), • • • , (ik,h) belong to the 

edge set E. 

Let b > 1 be a positive integer. A ^-matching is a collection of edges A C E such that every node is 
incident to at most b edges from A. Naturally, 1-matching is simply a matching. Note that 2-matching 
is collection of node disjoint paths and cycles. We will also call it path/cycle packing. 

Fix a constant c > 0. Let G(n, cn) denote a simple undirected sparse random graph on n nodes with 
cn edges selected uniformly at random from all the possible n(n — l)/2 edges. This is a standard model 
of a sparse random graph. Denote by IAfT>(n,c),CUT(n,c,k) M(n,c,b) the size (carndinality) of the 
maximum independent set, fe-cut and 6-matching, respectively, in G(n,cn). Suppose, in addition, the 
nodes and the edges of G(n, cn) are equipped with random non-negative weights W™ de , W^ ge drawn in- 
dependently according to some common probability distributions P{VF nodc < t} = w node (t),F{W edge < 
t} = w edge (t). We assume throughout the paper that both W node and W edge have a bounded support 
[0, B w ] (assumed the same for simplicity). Let lMT> w (n,c),CUT w (n,c,k) M. w (n, c, b) denote the max- 
imum weight independent set, /c-cut and 6-matching, respectively, where the weight of an independent 
set is the sum of weights of its nodes, and the weights of a cut and 6-matching are defined as the sums 
of weights of edges in them. 

It is well known and simple to prove that ZNT>(n, c), CUT(n, c, k) M{n, c, 1) are all O(n) w.h.p. as 
n diverges to infinity. For example since a fixed node % is isolated with a positive constant probability, 
then K\XAfT>(n, c)] = @(n). Since any matching is also a 6-matching for b > 1, then M(n,c,b) = 0(n). 
Also the length of the longest path in G(n,cn) is also 0(n), thanks to the result of Frieze [Fri86 . 

It is natural to suspect then that the expected values of these objects divided by n converge to a 
constant, both in the unweighted and weighted cases. In other words, the scaling limits exist for these 
objects. In fact, it is conjectured in Aid] and [DGHS03 , respectively, that the scaling limits 

v E[2ATV(n,c)] E[CUT(n,c)] 
(1) hm — — , hm — — 

n— *oo n ra— >oo n 

exist. The existence of these limits for expectation would also imply almost sure limits, by application 
of Azuma's inequality. 
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The scaling limit of the form (^Q) is in fact proven for maximum cardinality matchings M.{n,c) by 
Karp and Sipser |KS81| . The result was strengthened later by Aronson, Frieze and Pittel |AP F98 . 
Karp and Sipser proved that, almost surely, 



(2) 



lim 



M(n,c) 



= 1 - 



7,(c) + 7*(c) +7*(c)7*(c) 
2 



71 



OG 



n 



where 7*(c) is the smallest root of the equation x = cexp(— cexp(— x)) and 7*(c) = cexp(— 7*(c). Their 
algorithmic method of proof is quite remarkable in its simplicity. We briefly describe the argument 
below and explain why, however, it does not apply to the case of weighted matchings. Suppose we 
are given a (non-random) graph G. Then the following algorithm finds a maximum matching (clearly 
there could be many maximum matchings): while the graph contains any leaves, pick any leaf of a tree 
and the corresponding edge. Mark the edge and delete the two nodes incident to the edge and all the 
other leaves that share the (deleted) parent with the selected leaf. Delete all the edges incident to these 
leaves and delete the edge between the parent and its own parent. Repeat while there are still leaves 
in the graph. When the graph has no more leaves select a maximum size matching in the remaining 
graph. It is a fairly easy exercise to prove that this remaining matching plus the set of marked edges is 
a maximum matching. This fact is used to prove (J2J). 

One notes, however, that when edges of the graph are equipped with some weights, the Karp-Sipser 
algorithm does not necessarily work anymore. Occasionally it might be better to include an edge 
between a parent of a leaf and and a parent of a parent of a leaf and, as a result, not include the edge 
incident to the leaf. Therefore, the Karp-Sipser algorithm may produce a strictly suboptimal matching 
and the results ((21) do not hold for the weighted case. Moreover, it is not clear how to extend the 
Karp-Sipser heuristic to 6-matchings. In this paper we prove the convergence © for the maximum 
weight 6-matchings. The proof uses the main result of the paper and the linear programming duality, 
though we are not able to compute the limits. Naturally, our result applies to the non- weighted case - 
maximum cardinality 6-matching. To the best of our knowledge this is a new result. 

The case of maximum weight matching with random weights is treated by Aldous and Steele AS03 
for the case of a randomly generated tree on n nodes. That is, consider a tree selected uniformly at 
random from the set of all possible n n ~ 2 labelled trees. The limit of the sort © is proven and computed 
using the local weak convergence method, when the edges of this tree are equipped with exponentially 
distributed random weights. The tree structure of the underlying graph helps very much the analysis. 
In our case, however, the random graph G(n,cn) contains a linear size non-tree "giant" component, 
JLROnj, when c > 1/2, and the results of ( |AS05j ) are not applicable. 

Yet another scaling limit question is the existence of the limits for probability of A;-colorability in 
G(n,cn). A graph is defined to be k > 2 colorable if there exists a function mapping vertices of G 
to colors 1,2, ... ,k such that no two nodes connected by an edge have the same color. The following 
conjecture proposed by Erdos is found in Alon and Spencer AS92 . 

Conjecture 3 For every positive integer k > 2 there exists a critical value c* k such that the graph 
G(n,cn) is w.h.p. k-colorable for c < c* k and w.h.p. not k-colorable for c > c* k . 

This conjecture is very similar in spirit to Conjecture ^ For a survey of existing results see a recent 
Molloy's survey jMolOlj . For related results see also |AM02aj . COMS03j. 

3 Model and the main results 

There is a natural way to describe a K-SAT problem as an integer programming problem. The variables 
are X{,%= 1,2, ... , n which take values in {0, 1}. Each clause Cj is replaced by a linear constraint of the 
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form Xi x + {1— Xi 2 )+Xi 3 + . . . > 1, where term (1— x) replaces x. For example a clause C = X3 VX7VX2 VX4 
in a 4-SAT problem is replaced by a constraint £3 + X7 + (1 — X2) + (1 — X4) > 1. It is easy to check that 
an assignment of X2, X3, X4, x-j to and 1 gives C value 1 if and only if the corresponding constraint 
is satisfied. Clearly, these constraints can be created for all the possible clauses. In the present paper 
we study the linear programming (LP) relaxation of this integer programming problem, where the 
restriction Xi E {0, 1} is replaced by a weaker restriction Xi £ [0, 1]. Note, that this relaxation by itself 
is not interesting, as the assignment Xi = 1/2 for all % = 1, 2, . . . , n makes all of the linear constraints 
feasible. However, the problem becomes non-trivial when we generalize the types of constraints that 
can be generated on the variables Xj, and this is described in the following subsection. 

3.1 Random K-LSAT problem 

Our setting is as follows. Consider a fixed collection of K variables yi, JJ2, ■ ■ ■ , Uk which take values in 
some bounded interval B\ < yi < B% and a fixed collection C of linear constraints on these variables: 
]Cf=i a rkUk < Wi r = 1) 2, . . . , [C|, where the values a r fc, b r are arbitrary fixed reals. The r-th constraint 
can also be written in a vector form a r y < 6 r , where a r = (a r ±, . . . , a r x) and y = (y\, . . . , yx)- We fix 
c > and let m = cn, where n is a large integer. A random instance of a linear constraint satisfaction 
problem with n + m variables x%, . . . , x n , tpi, . . . , ip m and m constraints is constructed as follows. For 
each j = 1, 2, ...,m we perform the following operation independently. We first select K variables 
, Xi 2 , . . . , Xi K uniformly at random from Xi,i = 1, 2, . . . , n. Whether the variables are selected with 
or without replacement turns out to be irrelevant to the results of this paper, as it is the case for 
random K-SAT problem. However, the order with which the variables are selected is relevant, since the 
constraints are not necessarily symmetric. Then we select 1 < r < \C\ also uniformly at random. We 
then generate a constraint 

K 

(3) Cj : ^2 a rkX ik <b r + ipj. 

k=l 

Here is an example of an instance with K = 3,n = 10, m = 4, \C\ = 2. Say the first constraint C\ is 
2yi + 3i/2 — < 5, and the second constraint C2 is — y\ + 2/2 + 4^3 < 2. An example of an instance 
where first three constraints are type C\ and the fourth is type C2 is 

(2x5 + 3x4 — xg < 5 + ipi) A (2xi + 3x3 — X4 < 5 + 
(2x 2 + 3xi - xio < 5 + ^3) A (-x 5 + x 8 + 4x 7 < 2 + ^0- 

The central question is what are the optimal values of B\ < Xj < B^ipj ^ 0, which minimize the 
sum ifrj subject to the constraints Cj. That is, we consider the following linear programming problem: 

(4) Minimize ifij, subject to : C\, C2, ■ ■ ■ , C m , Xj € [B^., B^,],^j > 0. 

l<j<m 

In words, we are seeking a solution Xj which is as close to satisfying the constraints Ylk=l a rik x ik — 
as possible. If the optimal value of this linear programming problem is zero, that is tfij = for all j, 
then all of these constraints can be satisfied. Naturally, the objective value of the linear program Q 
is a random variable. We denote this random variable by CV(n,c). Note, that the linear program Q 
is always feasible, by making t/jj sufficiently large. In fact, clearly, in the optimal solution we must 
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have ijjj = max(0, Y2k=i a ri k x i k ~ W)- We refer to the linear program (jlj as a random linear constraint 
satisfaction (LSAT) problem, or random K-LSAT problem. 

The following conditions on the set of constraints C will be used below. 

• Condition A. For any constraint a r y < b r ,l < r < \C\ for any k < K and for any value 
z e [B2,B%] there exist values yi,...,yx € [B^-B^] such that = z and the constraint is 
satisfied. 

• Condition B. There exist a positive integer I and a constant v > such that for any K- 
dimensional cube / of the form ni<fe<_R'[^f > ^tHj-^z < y- < integer, there exists at least 
one constraint Yl a rkUk < &r from C such that for every y £ I , ^ Orfeyfc — 6 r > ^. That is, every 
point of the cube I deviates from satisfying this constraint by at least v. 

The analogue of the Condition A clearly holds for random K-SAT problem. Given any clause 
2/1 V 2/2 V • • • V yx and k < K, if y^ is set to be or 1, we still can satisfy the clause, by satisfying 
any other variable. The following is an example of an LSAT problem where Conditions A and B are 
satisfied. Fix K = 3. Let B^. = 0,B%. = 1, and let C be a collection of all eight constraints of the 

type -yi - 2/2 - 2/3 < -7/ 4 > -(1 - yi) - V2 - V3 < -7/ 4 > ■ • • > ~(1 - Vl) ~ (1 - 2/2) - (1 - 2/3) < -7/ 4 - 
Condition „4 is checked trivially. We claim that Condition B holds for I = 2 and v = 1/4. Select any 
cube I with side-length l/l = 1/2. For example I = [0,1/2] x [1/2,1] x [1/2,1]. Consider constraint 
-Vi-(1-Ite)-(1-Jfe) < -V 4 - For any y G / wehave -yi-(l-y 2 )-(l-y 3 ) > -7/4+1/4 = -7/4+ 1/. 
Other cases are analyzed similarly. 

Consider now the following generalization of the linear program l|4"j). For each j = 1, 2, . . . , m generate 
a random variable Wj, independently from some common distribution P{Wj < t} with a bounded 
support [— B W ,B W ]. Let u> x > and > be fixed non-negative constants. Our random linear 
program in variables Xi,ipj is constructed exactly as above except each constraint Cj : Yli<r<K a rkXi k < 
b r + ipj is replaced by 

(5) Cj : ^2 a rkXi k <K + Wj + tpj, 

l<r<K 

and the objective function is replaced by 



(6) Minimize w x £i<i<„ x { + Wi> Ei<;< m *l>j » 

subject to : Ci, C2, . . . , C m , G [S^, B%], tpj ^ 0- 

This particular form of the linear program might look unnatural at first. But note that setting 
B w = w x = 0, w^p = 1, turns this into exactly linear program (|4*|). We will show later that this general 
format is useful when we study 6-matchings in sparse random graphs G(n, cn). We denote the optimal 
value of the linear program @ by QCV{n,c). As before, this linear program is always feasible, by 
making tpj sufficiently large. Since we assumed > 0, then in the optimal solution 

(7) Vj = max(0, a rk x ik -b r -Wj). 

We now state the main result of this paper. In words, our result asserts that the scaling limit of 
QCV(n,c)/n exists. 



7 



Theorem 1 For every c > 0, the limit 

(8) Hm QL ™ ^ /(c) 

exists w.h.p. That is, there exists /(c) > such that for every e > ; 

S^c) 
n 

as n — > oo. 

Our first application of Theorem ^ is the following result. It establishes a linear phase transition 
property for the random K-LSAT problem. Recall that CV(n, c) is the optimal value of the linear 
programming problem (J3J. 

Theorem 2 There exists a constant c* K > suc/i £/iai, w.h.p. as n — > oo, 

(9) hm = 0, 
/or aZZ c < c|^, and 

(10) liminf £P(n,C) > 0, 

for all c > c* K . Moreover, if Condition A holds, then c* K > 0, and if Condition B holds, then c* K < +oo. 

In what sense does the theorem above establish a linear phase transition? It is conceivable that 
for a collection of constraints C, the following situation occurs: there exist two constants c\ > C2 
and two sequences n^\ni\t = 0,1,2,..., such that for c = c\ the corresponding optimal val- 
ues (@J of the random K-LSAT problem satisfy w.h.p. lim^ jCP(n^ , c)/n^ = 0, but for c = C2, 

(2) (2) 

liminff CV{n\ ,c)/n\ > 5(c) > 0. In other words, the critical density c oscillates between different 
values. This is precisely the behavior that Conjectures ^ and [21 rule out for random K-SAT problem. 
Our theorem states that such a thing is impossible for the random K-LSAT problem. There exists a 
linear function c* K n such that, w.h.p., below this function the instance is very close to being feasible, 
but above this function the scaled "distance" min(l/n) ^ tpj to feasibility is at least a positive constant. 

The statement of the theorem above does not fully match its analogue, Conjecture ^ as i using the 
auxiliary variables ipj we converted the feasibility problem to the optimality problem. Now consider the 
collection of constraints Cj where tjjj are set to be zero, and we ask the question whether the collection 
of constraints in (j3J) has a feasible solution. We suspect that this problem does experience a linear phase 
transition, but we do not have a proof at the present time. 

Conjecture 4 Let c* K be the value introduced in Theorem® Then, w.h.p. as n — > oo, the random 
K-LSAT problem with cn constraints is satisfiable if c < c* K and is not satisfiable if c> c* K . 

In this paper we use local weak convergence method to prove Theorem^ While our approach is 
very much similar to the one used in |Ald92j . there are several distinctive features of our problem. In 
particular, we do not use an infinite tree construction and instead consider a sequence of finite depth 
trees with some corresponding sequence of probability measures. Then we use a Martingale Convergence 
Theorem for the "projection" step. This simplifies the proofs significantly. 
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3.2 Maximum weighted ^-matching 



We return to the setting of Subsection 12,21 We have a sparse random graph G(n,cn), where c is a 
positive constant. The edges of these graph are equipped with random weights Wi j which are selected 
independently from a common distribution P{Wjj < t} = w edge (t),0 < t < B w < oo, where [0, 5 W ] 
is the support of this distribution. Again let Ai w (n, c, b) denote the maximum weight 6-matching in 
G(n,cn), where b > 1 is an integer. 

Theorem 3 For every c > the limit 

(11) lim M ^ b ) s g{c) 

exists w.h.p. 

The probability in the statement of the theorem is both with respect to the randomness of G(n, cn) 
and with respect to the random weights. This theorem is proven in Section|HJ We use linear programming 
duality and certain linear programming formulation of the maximum weight 6-matching problem in order 
to related it to our main result, Theorem ^ 



3.3 Proof plan 

Below we outline the main steps in proving our main result, Theorem Q Let E[-] denote the expectation 
operator. The general scheme of the proof follows the one from Aldous |Ald92| . 

1. We first observe that, as in the case of a random K-SAT problem, in the limit as n — > oo, the 
(random) number of constraints containing a fixed variable x from the pool x\, . . . , x n is distributed 
as a Poisson random variable with parameter cK, denoted henceforth as Pois(c-KT). 

2. For every c > we introduce 

E[gCV(n,c)] 



(12) A(c) = liminf- 



Our goal is to show that in fact convergence lim ra E l0CV{n,c)] an( j th ere f re we can set 

/(c) = A(c). The convergence w.h.p. will be a simple consequence of Azuma's inequality. Then, 
in order to prove Theorem[2J we prove that c* K = sup{c : f(c) = 0} satisfies the properties required 
by the theorem. 

3. We consider a subsequence ni, nz, . . . , Hi, . . . along which E ^ £ ^"" c ^ converges to A(c). Let 
Xi,..., X ni , . . . , ^cn t £ [Bx, B^.] ni x [0, oo) cni denote a (random) optimal assignment which 
achieves the optimal value QCfPijii). For each rii we pick a variable x\ from the pool x±, • • • , x ni 
(the actual index is irrelevant) and consider its depth d neighborhood appropriately defined, where 
d is some fixed constant. We then consider the optimal solution (X(rii, d), ^(rii, d)) restricted to 
this (i-neighborhood. We consider the probability distribution V(d,rii) which describes the joint 
probability distribution for the values of (Xi, fyj, Wj) for Xi, tyj in the (i-neighborhood as well as 
the graph-theoretic structure of this neighborhood. 

We show that for each fixed d, the sequence of probability measures V(d,rii) is tight in its corre- 
sponding probability space. As a result, there exists subsequence of rii along which the probability 
distribution V(d,rii) converges to a limiting probability distribution V(d) for every fixed d. More- 
over, we show that the subsequence can be selected in such a way that the resulting probability 
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distributions are consistent. Namely, for every d < d 1 , the marginal distribution of V(d!) in d- 
neighborhood is exactly V(d). We will show that, since the sequence n« was selected to achieve 
the optimal value K[QCV(rii,c)] ~ X(c)n, then 



where the expectation is with respect to V(d) and the summation is over all the constraints Cj 
containing X\. 

The sequence of probability distributions V(d),d = 1,2,... was used by Aldous in Ald92j to 
obtain an invariant (with respect to a certain pivot operator) probability distribution on some 
infinite tree. Our proof does not require the analysis of such a tree, although similar invariant 
measure can be constructed. 

4. We consider a random sequence E[Xl d = 1,2,... , where X\ G [B^., B%] is, as above, the value 
that is assigned to the variable x\ by an optimal solution, and 3^ is the filtration corresponding to 
the sequence of probability measures V(d), d = 1, 2, . . . . We prove that the sequence Epfi|Q^], d = 
1,2,... is a martingale. 

5. This is the "projection" step in which for any e > and an arbitrary large n we construct a 
feasible solution to the system of constraints © which achieves the expected objective value 
at most (A(c) + e)n. Given any large n and an instance of a random linear program © with 
variables xi, X2, . . . , x n , ipi, . . . , ip cn and constraints Cj, 1 < j < cn, for each variable Xi, 1 < i < n 
we consider its d-neighborhood, where d is a very large constant. We let the value of Xi be E[Xj|9^] 
where the expectation is conditioned on the observed d-neighborhood of the variable Xi and this 
information is incorporated by filtration 3^. By construction, this value is in [B^,B^.]. Then we 
set tyj to the minimal value which satisfies the j-th constraint for the selected values of Xi, for 
all j = 1,2,..., cn. Using a martingale convergence theorem and property (|13|) we show that for 
a randomly chosen variable Xi, the corresponding value of K[w x Xi + £^ ^fj] is smaller than 
A(c) + e, when n and d are sufficiently large. We sum the expectation above over all Xi and observe 
that each constraint belongs in the sum Y^j of exactly K variables X{. Then the sum of these 
expectations is K[w x ^i<i< n Xi + w^ Yli<j<cn which is exactly the objective function. We use 
this to conclude that the expected value of the objective function is at most (A(c) + e)n. 

4 Poisson trees and some preliminary results 

We begin by showing that in order to prove Theorem^ it suffices to prove the existence of a limit Q for 
the expected value of the optimal cost QCV(n, c). Indeed, note that given an instance of a linear program 
(JHJ), if we change one of the constraints Cj to any other constraint from the pool C and change the value 
of Wj to any other value in [— B W ,B W ], and leave all the other constraints intact, then the optimal 
value QLV{n,c) changes by at most a constant. Using a corollary of Azuma's inequality (see Corollary 
2.27 jJLROOj for the statement and a proof), we obtain that F{\GCV(n, c)/n - E[g£V(n, c)/n]\ > e} 
converges to zero exponentially fast for any e > 0. Then the convergence lim^^oo E[^£"P(n, c)]/n implies 
the convergence of QCP{n, c)/n holds w.h.p. Thus, from now on we concentrate on proving the existence 
of the limit 



(13) 




j 



(14) 



lim 



E[gCV(n, cn)] 



n 
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A random instance of a linear program @ naturally leads to a sparse weighted /f-hypergraph 
structure on a node set x±, . . . ,x n . Specifically, for each constraint Cj, 1 < j < cn we create a K-edge 
(xjj, . . . ,Xi K ,r,Wj), if Cj contains exactly variables x^ , . . . , x^ K in this order, the constraint is type 
r,l < r < \C\ and the corresponding random variable Wj has value Wj. This set of edges completely 
specifies the random instance. We first study the distribution of the number of edges containing a given 
fixed node x = x\, . . . , x n . 

Lemma 4 Given node x from the pool x\, . . . , x n , the number of edges ( constraints Cj) containing x is 
distributed as Pois(c-fT), in the limit as n — > oo. 

Proof : The probability that a given edge does not contain x is 1 — K/n if variables are taken 
without replacement and ((n — l)/n) K = 1 — K/n + o(l/n) if taken with replacement. The probability 

(cn \ 
J {K/n) s {l — K/n) cn ~ s . When s is a constant 

and n — > oo, this converges to ^-jr~ exp(—cK). □ 

We now introduce a notion of a d-neighborhood of a variable x. A collection of constraints 
Cij , Cj 2 , . . . , Ci r , 1 < i j < m from an instance of linear program © is defined to be a chain of length 
r if for all j = 1, ... ,r — 1 the constraints Cj. and Cj- +1 share at least one variable. Fix a variable x 
from the pool Xi, 1 < i < n. We say that a variable x' £ {x\, . . . ,x n } belongs to a d-neighborhood of 
x if x is connected to x' by a chain of length at most d. We say that a constraint Cj belongs to the 
^-neighborhood of x if all of its variables belong to it. The variables Wj and ipj in these constraints 
are also assumed to be a part of this neighborhood. In particular, a 1-neighborhood of x is the set of 
constraints Cj which contain x together with variables in these constraints. If no constraints contain x, 
the 1-neighborhood of x is just {x}. For d > 1, we let B(x, d, n) denote the (i-neighborhood of x, and let 
dB(x, d, n) = B(x, d, n) \ B(x, d — 1, n) denote the boundary of this neighborhood, where dB(x, 0, n) is 
assumed to be 0. Of course B(x, d, n) and dB(x, d, n) are random. In graph-theoretic terms, B(x, d, n) 
is a sub-graph of the original -fT-hypergraph corresponding to nodes with distance at most d from x. 

A chain C\ x , Cj 2 , . . . , Ci r is defined to be a cycle if C^ , Ci 2 , . . . , Cj r _ 1 are distinct and Cj r = C^ . The 
following observation is a standard result from the theory of random graph. A simple proof is provided 
for completeness. 

Lemma 5 Let r,r' be fixed constants. The expected number of cycles of length r is at most (K 2 c) r . 
Moreover, the expected number of variables with distance at most r' from some size-r cycle is at most 
rc r+r K 2r+r . In particular, for constants r, r' the probability that a randomly and uniformly selected 
variable x is at distance at least r' from any size-r cycle is at least 1 — TC + ^ — — . 

Proof : Fix any r variables, say x%,...,x r and let us compute the expected number of cycles 
Ci, . . . , C r such that Cj and Cj+i share variable Xj,j = 1, . . . , r, (where C r +i is identified with Ci). 
Constraint Cj contains variables Xj^\,Xj (xq = x r ). There are at most n K ~ 2 /(K — 2)! choices for 
other variables in Cj. For each such choice, the probability that a constraint consisting of exactly this 
selection of variables is present in the random instance is at most (cn) / \n K / K\) = K\c/n K ~ l . Finally, 
there are at most n r ways to select (with order) r variables x\, . . . , x r . Combining, we obtain that, 
asymptotically, the expected number of length-r cycles is at most 



n K-2 



.{K -2)\"'n K - 1 . 

By Lemma HI for any fixed variable x the expected number of variables with distance at most r' from x 
is asymptotically at most (cK) r . Each size r cycle contains at most rK variables. Therefore, the total 
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expected number of variables with distance at most r' to some size r cycle is at most (K 2 c) T (rK)(cK) r = 

rc r+r' j^2r+r' rj 



Applying Lemmas El we obtain the following proposition, which is well known in the context of 
random graphs |JLR 00 . 

Proposition 1 Given a variable x, the number of constraints in B(x,l,n) is distributed as Pois(c-KT), 
in the limit as n — > oo. Also B(x,d,n) is distributed as a depth-d Poisson tree. That is, if dB(x,d,n) 
contains r constraints and r(K — 1) variables, then the number of constraints in dB(x,d + l,n) is 
distributed as Poi$(crK(K — 1)). Moreover, these constraints do not share variables, other than variables 
in dB(x,d,n). In short, the constraints in B(x,d,n) are distributed as first d steps of a Galton-Watson 
(branching) process with outdegree distribution Pois(cK"), in the limit as n — > oo. 

We finish this section by showing how Theorem ^ implies Theorem |2J We noted before that the 
linear program Q is a special case of the linear program ((Sj). via setting Wj = with probability one 
and by setting w x = 0, uy, = 1. Assuming limit /(c) defined in Theorem ^ exists, let us first show that 
it is a non-decreasing function of c. This is best seen by the following coupling arguments. For any 
c\ < C2 and large n consider two instances of random linear program Q with m = c\n and m = C2n, 
where the second is obtained by adding (c2 — c\)n additional constraints to the first instance (we couple 
two instances). For each realization of two linear programs, such an addition can only increase or leave 
the same the value of the objective function. Note that in both cases we divide the objective value by 
the same n. We conclude f(c\) < /(C2). 

Let c* K = sup{c > : /(c) = 0}. Clearly the set in this definition includes c = 0, and therefore is 
non-empty. The definition of c* K of course includes the possibilities c* K = 0, c* K = 00 and clearly satisfies 
@ and P)). 

The proof of the second part of Theorem |2] follows from the following proposition. 

Proposition 2 Under Condition A, for any c < 1/K 2 , a random K-LSAT instance has the optimal 
value W[£P{n, c)] = 0(1). Under Condition B, there exists ck > such that for all c > ck, there exists 
5(c) > for which E[£V(n,c)] > 5(c)n. 

Proof : Suppose Condition A holds. The technique we use is essentially "set one variable at 
a time" algorithm, used in several papers on random K-SAT problem to establish lower bounds on 
critical values of c, see, for example |AS00| . Set variable x\ to any value in [B*,B%.] (actual value is 
irrelevant). For every constraint containing x\ (if any exist), set other variables in this constraint so that 
this constraints are satisfied with corresponding values of tp equal to zero. This is possible by Condition 
A. For every variable which is set in this step, take all the other constraint containing this variable (if 
any exist) and set its variables so that again these constraints are satisfied with '0 = 0. Continue in 
this fashion. If at any stage some newly considered constraint Cj contains some variable which was set 
in prior stages, set V to the minimum value which guarantees to satisfy this constraint. At the worst 
we put ijj = max r (|6 r j + J2k \ a rk\ \B%\). Note, however, that this situation occurs only if Cj 

belongs to some cycle. Applying the first part of Lemma for c < 1 / K 2 the total expected number 
of constraints which belong to at least one cycle is at most Y^r^i r {K 2 c) r = 0(1). Therefore, the total 
expected number of constraints, for which we set ip positive, is also O(l) and ¥,[CV(n,c)] = 0(1). 

Suppose now Condition B holds. The proof is very similar to proofs of upper bounds on critical 
thresholds for random K-SAT problems and uses the following moment argument. Consider an instance 
of random K-LSAT with n variables m = cn constraints, where c is a very large constant, to be specified 
later. Consider any n-dimensional cube I of the form rii<fc<n[^f > ^F"]; — T < ^x- Consider the 
optimal cost CV(n,c) corresponding to solutions x,ip, such that x is restricted to belong to /. Let, 
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w.l.g., x\, . . . , x 3 K be the variables which belong to the j-th. constraint, 1 < j < m. By Condition B, with 
probability at least 1/|C|, Cj is such that the corresponding ipj must be at least v. We obtain that the 
expected cost E[£"P(n, c)] > v(l/\C\)m, when x G /. Moreover, since the events "Cj is such a constraint" 
are independent for all j, then, applying Chernoff bound, ¥{CVi(n, c) < (l/2)u(l/\C\)m} < e *TcT where 
the optimum CPi{n,c) of the linear program © is taken over the subset x € /. Since there are at 
most l n different cubes /, and every solution x belong to at one of them (several for points x on the 

_ m 

boundary between two cubes) then F{CP(n, c) < {l/2)u{l/\C\)m} < l n e s l c l . By taking m = cn with 
c sufficiently large, we obtain that the probability above is exponentially small in n. □ 

5 Limiting probability distributions 

In this and the following two sections we prove the existence of the limit (|14j) . Fix c > and take n 
to be large. We assume that the labelling (order) of the variables x\,... ,x n is selected independently 
from all the other randomness of the instance. In graph-theoretic sense we have a random labelled 
hypergraph with labels of the nodes independent from the edges of the graph. We noted above that 

(15) ijj< max (J2(\BI\ + \B%\)K\a rk \ + \b r \\ = B f . 

~ ~ ^ k 

Let, X{n),^{n) denote an optimal (random) assignment of the random linear programming problem 
©, where X(n) = {X\, . . . , X n ), ^(n) = . . . , ^ m ). That is xi = Xi,tpj = achieve the objective 
value QCP(n, c). If the set of optimal solutions contains more than one solution (in general it can be 
uncountable) select a solution X(n),^(n) uniformly at random from this set. Define 

n n 

and find a subsequence nt, t = 1, 2, . . . along which 

t n t 

Fix a variable x from the set of all nt x- variables. Since labelling is chosen independent from the instance, 
we can take x = x±, w.l.g. Denote by X(d, n t ), ^(d, nt), W(d, nt) the collection of X, ^ and VF-variables 
which belong to the neighborhood B(x\,d,nt). In particular X\ € X(d,n t ) and the number of ^ 
variables is the same as the number of W variables which is the number of constraints Cj in B{x\, d, nt). 
Denote by Vid,nt) the joint probability distribution of (B(x\, d, n t ), X{d, n t ), ^(d, nt), W(d, nt)). We 
omit x = xi in the notation V(d, nt) since, by symmetry, this joint distribution is independent of the 
choice of x. The support of this probability distribution is X{d) = U(T, [B^, -B^]' T ' x [0, B^] E ^ x 
[-B w , B W ] E ^) U £, where the first union runs over depth-d rooted trees T with root x\, \T\ is number 
of nodes in T, E(T) is the number of constraints in T, and 8 is a singleton event which represents the 
event that B{x\,d,nt) is not a tree and contains a cycle. In particular, X{d) is a countable union of 
compact sets. We have from Proposition ^ that lim^oo V(d, nt)(£) = 0. Observe that X(d) C X{d+ 1) 
for all d. As we showed in Proposition the marginal distribution of T with respect to V(d,nt) is 
depth-d Poisson tree, in the limit as t — > oo. 

Recall, that a sequence of probability measures V n defined on a joint space X is said to be weakly 
converging to a probability measure V if for any event A C X, \\m n ^ 00 'P n {A) = V(A). We also need 
the following definition and a theorem, both can be found in |Dur96j . 
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(16) 



(17) 



Definition 1 A sequence of probability measures V n on X is defined to be tight if for every e > there 
exists a compact set K C X such that limsup n P{X \ K} < e. 

Theorem 6 Given a tight sequence of probability measures V n on X there exists a probability measure 
V on X and a subsequence V nt that weakly converges to V. 

The following proposition is a key result of this section. 

Proposition 3 For each d = 1,2,... there exists a probability measure V(d) on X{d) such that V(d, nt) 
weakly converges to V(d). The sequence of probability measures V(d),d = 1,2,... is consistent in the 
sense that for every d < d' , V(d) is the marginal distribution ofV{d!) on X(d). The probability of the 
event 6 is equal to zero, with respect to V{d). Finally, with respect to V{d), 



where the summation is over all the constraints Cj in T(d) containing the root variable x\ . 

Proof : The proof is similar to the one in |Ald92 for constructing a similar limiting probabil- 
ity distribution of optimal matchings in a complete bi-partite graphs with random weights, where a 
compactness argument plus Kolmogorov's extension theorem is used to obtain the limiting measures 
on infinite tree. In our case, since we limit ourselves to trees with bounded depths, the proofs can be 
simplified by using the tightness argument. Fix d > 1. We claim that the sequence of measures V(d, nt) 
is tight on X(d). By Proposition^ according to the measure V(d, n t ) the neighborhood B(xi, d, nt) ap- 
proaches in distribution a depth-d Poisson tree with parameter cK. In particular, the expected number 
of constraints in this neighborhood is smaller than cK + c 2 K 3 + . . . + c d K 2d ^ 1 = Mo, in the limit t — > oo. 
Fix e > 0. By Markov's inequality the total number of constraints in B(xi,d,nt) is at most M with 
probability at least 1 — e, for M > Mq/e and nt sufficiently large. This implies that, moreover, each 
x-variable in B(xi,d,n t ) belongs to at most M constraints (has degree at most M) with probability at 
least 1 — e. 

Let X(d,M) C X{d) denote U(T, [B l x , B%]\ T \ x [0, A/>] S(T) x [-B W ,B W } E ^), where the trees T are 
restricted to have degree bounded by M. That is, each variable in such a tree belongs to at most M 
constraints - one towards the root and M — 1 in the opposite direction, and the root x belongs to at 
most M constraints. The number of trees T in X(d, M) is finite and, as a result, the set X(d, M) is 
compact, as it is a finite union of sets of the form {T} x [B^B 2 ]^ x [0,B^ E ^ x [— B w , B W ]\ E ^. 
We showed above that according to V(d,n t ), the neighborhood B{x\,d,n t ) belongs to X(d, M) with 
probability at least 1 — e, for all sufficiently large t. This proves that the sequence of measures V(d,nt) 
is tight. Then, applying Theorem El there exists a weakly converging subsequence V(d, n^). We find 
such a sequence for d = 1 and denote it by V(l, np),t = 1, 2, . . . . Again using TheoremEl for d = 2 
there exists a subsequence of V(2,n[^) which is weakly converging. We denote it by P(2,nf ) ). We 
continue this for all d obtaining a chain of sequences np D D ••• D nf^ D ■ • • . Select a diagonal 
subsequence n\ = from these sequences. Then all the convergence above holds for this diagonal 
subsequence. In particular, for every d, V(d, n£) converges to some probability measure V(d). Moreover, 
these measures, by construction, are consistent. Meaning, for every d < d', the marginal distribution of 
■p(cf) onto X(d) is simply V(d). Since, from Proposition^ the probability of the event £ according to 
V(d,n t ) approaches zero, then the probability of the same event with respect to V(d) is just zero, for 
every d. 



(18) 
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To complete the proof of the proposition we need to establish (|18|). Note that in random instances 
of linear program ©, with nt ^-variables, when we sum the expression w x X{ + -jfYLj^j over an 
Xi,i = 1,2, ...,n t we obtain w x Yli<i< nt ^» + ^ Si<j< C n t ^i' smce each variable Vl/j appeared in 
exactly K sums, corresponding to K variables in j-th constraint. From 1)17|) we obtain that lim^ ~E[w x Xi + 
-jf-Ylj^j] = M c )> where the expectation is with respect to measure V(d,n t ). Since all the random 
variables involved in w x Xi + ]TV Vl/j are bounded and since V(d,n^) converges weakly to V(d), then 
the convergence carries on to the expectations. This implies that with respect to V(d), E,[w x Xi + 



6 Filtration and a martingale with respect to V(d) 

Let us summarize the results of the previous section. We considered a sequence of spaces X{1) C X{2) C 
X{3) C ••• , where X(d) = U(T,[B X , B^ x [0,B^ E( ^\ x [-B w , B W ]\ E( - T ^) and the union runs over 
all depth-d trees T. Recall, that the event £ was used before to generically represent the event that 
B(x, d, n) is not a tree. The probability of this event is zero with respect to the limiting probability 
distribution V(d) we constructed, so we drop this event from the space X(d). We have constructed a 
probability measure V(d) on each X{d) as a weak limit of probability measures V(d,rit), defined on 
(i-neighborhoods of a variable x\. Denote by (T(d), X(d),^(d),W(d)) the random vector distributed 
according to "P(d). Note, that T(d) and W{d) are independent from each other, first distributed as a 
depth-d Poisson tree, second distributed as i.i.d. with the distribution function F{W < t}, yet X(d) 
and ^(d) depend on both T(d) and W(d). 

Using Kolmogorov's extension theorem, the probability measures V(d) can be extended to a prob- 
ability measure V on space U(T, [Bl,B 2 x f° x [0,5^]°°) x [— B w , B w ] °°), where the union runs over all 
finite and infinite trees T. This extension is used in Aldous |Ald92j to construct a spatially invariant 
measure on infinite trees. For our purpose this extension, while possible, is not necessary, and instead 
we use a martingale convergence techniques. 

Note, that, since the measures V(d) are consistent, they correspond to a certain filtration on the 
increasing sequence of sets X(d),d = 1, 2, . . . . Then we can look at K[Xi\(T(d), W(d))] as a stochastic 
process indexed by d = 1, 2, . . . . Another way of looking at this stochastic process is as follows. A root 
x = x\ is fixed. At time d = 1 we sample constraints Cj containing x\ with the corresponding values of 
Wj using the 7^(1) distribution. Recall that the number of constraints is distributed as Pois(c-fT), the 
type of each constraint is chosen uniformly at random from \C\ types, and Wj values are selected i.i.d. 
using distribution function ¥{W < t}. For this sample we have a conditional probability distribution 
of (A(l), ^(1)), that is of x and variables in 1-neighborhood of x\. Then, for each variable in this 
depth-1 tree except for x\ we again sample constraints and Wj values to obtain a depth-2 Poisson tree 
T(2) and W{2). We obtain a distribution of (A(2),^(2)) conditioned on T(2), W(2), and so on. On 
every step we reveal a deeper layer of the tree and and the corresponding values of Wj-s to obtain a 
new conditional probability distribution for (X(d), ^f(d)). 

The technical lemma that follows is needed to prove Proposition 0] below. This lemma is sometimes 
defined as tower property of conditional expectations. The proof of it can be found in many books on 
probability and measure theory, see for example Theorem 1.2, Chapter 4 in [pur 96 . 

Lemma 7 Let X and Y be dependent in general random variables, where X € R and Y takes values 
in some general set y. Let f : y — > y be a measurable function. Then M\E[X\Y]\f(Y)] = M[X\f(Y)]. 

The conditional expectations in this lemma are understood as follows. E[X|y] is an expectation 
of X conditioned on the cry-algebra generated by the random variable Y. Similarly, E[X|/(Y)] is an 
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expectation of X conditioned on the c/(y) -algebra generated by the random variable f(Y). Of course 
(Tf(Y) C (Ty. 

As above let x\ be the root of our random tree T(d) and let X\ be the corresponding random 
variable from the vector (T(d), X(d),^/(d),W(d)). Conditioned on the event that x\ belongs to at 
least one constraint Cj, select any such a constraint and let, w.l.g. x 2 , ■ ■ ■ ,xx be the variables in this 
constraint. For every k = 2, . . . ,K and every d > 2, consider B(x k ,d) = T(x k ,d), the d-neighborhood 
of the variable x k . One way to introduce this neighborhood formally is to select any d' > d + 1, sample 
T{d') from V^d') and then consider the d-neighborhood around x k , as a subtree of the tree T{d ! ). But 
since, by consistency, the distribution of these neighborhood is the same for all d' > d+ 1, then we can 
simply speak about selecting T(x k , d). Note, that T(x±,d) = T(d). Let W(x k , d) denote the collection of 
Wj variables corresponding to constraints in T(xk,d). To simplify the notations, we will let T w (x k ,d) 
stand for the pair (T(x k , d), W(x k , d)). 

Proposition 4 For every k = 1, 2, . . . , K a random sequence E[X k \T w (x k , d)], d = 1, 2, . . . is a mar- 
tingale with values in [B^,,B 2 ]. Moreover, the sequence E[X k \T w (x d mod (K),<£)], d = 1,2,... is also a 
martingale. 

Remark : To understand better the meaning of the first part of the proposition, imagine that we 
first sample depth-1 tree T w (l). In case this tree is not trivial (T w (l) ^ {^i}), we fix a constraint from 
this tree and variable x k from this constraint. Then we start revealing trees T w (x k , 2), T w (x k , 3), . . . . 
This defines the stochastic process of interest. 

The second part states that even if we reveal trees rooted in a round-robin fashion at variables 
xi, X2, . . . , xk, xi, X2, ■ ■ ■ , then we still obtain a martingale. That is the sequence E[Xfc|T H/ (xi, 1)], 
E[X k \T w (x 2 , 2)], . . . , E[X k \T w (x K , K)], E[X k \T w (x 1} K + 1)], E[X k \T w (x 2 , K + 2)},..., is a martin- 
gale. 

Proof : Recall, that the optimal values X(d) are a weak limit of optimal values X(d,n^),t = 
1,2,... . Then, every Xi € [B^B^] almost surely. To prove the martingale property we use Lemma 
|7| where X = X kl Y = T w (x k ,d + 1) and / is a projection function which projects a depth-ci + 1 tree 
T w (x k ,d + 1) onto a depth-d tree T w (x k ,d) by truncating the d + 1-st layer. Applying the lemma, 
we obtain E[E[X k \T w (x k , d + 1)]\T W (d)] = E[X k \T w (x k ,d)], which means precisely that the sequence 
is a martingale. The proof of the second part is exactly the same, we just observe that T w (x±, 1) C 
T w (x 2 ,2) C • • • C T w (x d mod ( K ),d) C • • • , and we let / to be a projection of T w (x d+1 mod (K) ,(i+l) 
onto T w (x d mod ( K ),d). □ 

The following is a classical result from the probability theory. 

Theorem 8 [Martingale Convergence Theorem]. Let X n ,n = 1,2,... be a martingale which 
takes values in some finite interval [-B, B\. Then X n converges almost surely to some random variable 
X. As a consequence, \X n+ \ — X n \ converges to zero almost surely. 

A proof of this fact can be found in |Dur96j . Here we provide, for completeness, a very simple proof 
of a weak version of MCT, stating that \X n+ \ — X n \ converges to zero in probability. This is the only 
version we need for this paper. We have 

(19) < E[(X n+1 - X n f] = E[X 2 n+l ) - 2E[X n+1 X n ] + E[X*]. 

But E[X n+ iA" n ] = E[(E[X n+ i|X n ])X n ] = E[A^], where ELY n+ i[A" n ] = X n is used in the second equality. 
Combining with (UU, we obtain < E[(X n+1 - X n ) 2 } < E[Xl +l ] - E[X%\ and therefore E[X%] is an 
increasing subsequence. Since E[X%\ < B 2 , then this sequence is converging and, from (|19j) . E[(A" n+ i — 
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X n ) 2 ] converges to zero. Applying Markov's bound, we obtain that |X n+ i — X n \ converges to zero in 
probability. 

Let J(x\) denote the set of constraints Cj in T(d) containing x\ and let J{x\) = |i7(a;i)|. Select 
again any constraint from J{x\). We denote this constraint by C{x\) and let it be Yli<k<K a rkVk < 
b r + Wj + tpj for some 1 < r < \C\, in an expanded form. Again let X2, x%, . . . ,xk denote the other 
variables in this constraint. Recall that the labelling of variables in this constraint was done consistently 
with the labelling of the tree T, and, as a result, it is independent from the measure V{d). Let 
a : {1, 2, . . . , K} — > {1, 2, . . . , K} specify the matching between the order in the constraint and in the 
tree. That is, our constraint C(x\) is Yl,i<k<K a rk x cr(k) < K + Wj + ipj. From we obtain that, for 
the limiting probability measure V(d), we also have ^ j = max(0, ^2 k a rk X a ^ —b r — Wj) almost surely. 
Let X k = E[X k \T w (x k ,d)} and let 

(20) *j = max(0, a rk X a{k) - b r - Wj). 

l<k<K 

That is ^>j is the optimal value to be assigned to ipj when the variables x k take values X k . Observe 
that, just like X k , tyj is a random variable. Its value is determined by the random tree T(x k , d) and the 
values of Wj, and, importantly, it is different from tyj. The following proposition is the main technical 
result of this section. Jumping ahead, the usefulness of this proposition is that if we assign the value 
X k to each variable x k and take ipj = ^j for each constraint containing x\, then we obtain that the 
corresponding objective value of the linear program Q "per variable" x\ almost achieves value A(c), 
provided d is sufficiently large. We will use this in the following section for the projection step, where 
for a random instance of linear program (jHJ), we assign value Xi for every variable Xi, 1 < i < n and 
obtain an objective value close to A(c)n, for arbitrary large n. 

Proposition 5 For every e > there exists sufficiently large d = d(e) such that E[w x X\ + ^ ^fj] < 
A(c) + e, where the summation is over all the constraints in J{x\). 

Proof : Fix 5 > very small. Fix any constraint Cj € J{x\) and a variable x k in it. We first 
show that for sufficiently large d, ¥{\X k — ~E[X k \T w (x\, d)]\ > 5} < 5. In other words, the expected 
values of X k conditioned on T w (x k ,d) and T w (x\,d) are sufficiently close to each other. For this 
purpose we use the martingale convergence theorem and Proposition 01 Take largest integer t such that 
tK + k < d. Since, by the first part of the proposition, E[X k \T w (x k ,d)], d = 1, 2, ... is a martingale, and 
since d-(tK + k) <K then, applying Theorem \E[X k \T w (x k , d)] - E[X k \T w (x k ,tK + k)]\ becomes 
very small w.h.p. as d becomes large. In particular, 

F{\E[X k \T w (x k ,d)}-E[X k \T w (x k ,tK + k)}\ > 5} < 5, 

for sufficiently large d. By the second part of Proposition 0J E[X k \T w (x d mod ( K ^,d)],d = 1,2,... is a 
also a martingale, and again applying Theorem El we obtain that 

¥{\E[X k \T w (x k ,tK + k)} — E[X k \T w (x\,tK + 1)]| > 5} < 5, 

Finally, again applying the first part of Proposition|l]to the sequence E[A A i|T M/ (xi, d)],d = 1,2,..., and 
using d — {tK + 1) < 2K, we obtain 

¥{\E[X k \T w { Xl ,d)} -E[X k \T w ( Xl ,tK + 1)]| > 5} < 5, 

for sufficiently large d. Combining and using X k = E[X k \T w (x k ,d)], we obtain 

(21) F{\X k -E[X k \T w ( Xl ,d)}\ > 35} < 5, 
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as claimed. 

For every constraint Cj G J{x\) introduce 



(22) i> j = max(0, £ a rk E[X a(k) \T w ( Xl ,d)\ - b r - Wj), 

k 

where a rk x a ^ < b r + Wj + ipj is the expanded form of the constraint Cj. That is ^> j is defined just 
like ^>j, except for conditioning is done on the tree T w (x\, d) instead of T w (x k , d). Applying (J20J), 1)21(1 
and recalling i$>j,tyj G [0, B^], we obtain that for our constraint Cj 

(23) P{|* i - Vj\ > 36 K max \a rk \} < Kb. 

rk 

Recall, that, according the measure V{d), J(x±) is distributed as Pois(ci^). Select a value M(5) > 
sufficiently large, so that F{J(xi) > M(5)} < 5. Conditioning on the event J(x\) < M(5) we obtain 

p{^|%-%| > 36M(6)Kmax\a rk \ J( Xl ) < M(5)\ 
j 

< p(3Cj G J{xi) : \$j - %| > 35K max \a rk \ J{x x ) < M(5)\ 
I rk J 

(24) < Af(<y)p(|*j- - *j| > 35Kmax\a rk \ J{x x ) < M(6)\ 

L rk ) 

M(5)P||^ - %| > max rA . \a rk \} 



~ P{J(xi) < M(£)} 

where the summation is over constraints in J(xi), the last inequality holds provided 5 < 1/K, and j 
in (|24|) corresponds to any fixed constraint in J(x\). Combining with the event J(x\) > M, we obtain 
from above that, without any conditioning, 

P{V|i- - #,-| > 3SM(S)Kmax\a rk \} < M(5){K + 1)5 + 5 < M(S)(K + 2)S. 

— ' rk 

3 

Since ^fj,^fj G [0, B^,], then the bound above implies 

(25) |eE*j]-E[J^*j]| < B^M(5)(K + 2)5 + 35M(5)Kmax\a rk \. 

j j 

Our final step is to relate ^j to the random variables ^fj, where (X(d),^>(d)) are drawn according 
to the probability distribution V(d). The distinction between ^/j and ^j is somewhat subtle and 
we expand on it here. As we observed above, the values of *$>(d) are determined almost surely by 
fyj = max(0,Y^ k a r kX a ^ — b r — Wj), with respect to the measure V{d), when the corresponding 
constraint is ^2 k a rk x a ^ < b r + Wj + ipj. These values of tyj are different, however, from 4>j which 
are obtained by first taking the expectations of X k conditioned on trees T w (xi,d) and then setting ^j 
according to (j22j). Naturally, Wj and ^ j are related to each other. Note, that for every constraint Cj 
almost surely 

(26) * d > a rk X a{k) -b r - Wj, Vj > 0. 
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By taking the conditional expectations and using the linearity of inequalities above, we obtain 

(27) E^-lT^xi^)] >J2 a rkHXa(k)\T W (xi,d)} -br-Wj, E[^-|T w (xi, d)} > 0, 

k 

where we use the trivial equality E[VFj|T w/ (xi, d)] = Wj. From the definition of in 1)221) . we obtain 
then E[^ , j|T w/ (xi, d)] > \3E' j almost surely, with respect to the random variables T w {x\,d). As a result 
K[tyj\T w (x\, d)] > where the summation is over constraints in J(x\). Therefore, 

(28) E[^^]>eE^-]. 

j j 

Recall from the last part of Proposition |3J that with respect to measure V(d), 

j 

Combining this with £S), and using a simple observation E[Xx] = E\E[Xi\T w fa, d)}] = E[Xi], 

we obtain 

E[w x Xi + ^£%] 
j 

< E[w x Xt + ^j] + w i ,B^M(5){K + 2)5 + 35M(5)K max \a rk \ 

K — ' rk 

j 

(29) = A(c) + w^M(6)(K + 2)5 + 35M(5)K max |a rfe |. 

rk 

Recall that J{x\) is distributed as Pois(c-fT) and, in particular, has exponentially decaying tails. There- 
fore, for any e > we can find sufficiently small 5 and the corresponding M{5) such that 5M{5) < e 
and P{J(xi) > M(5)} < 5. All the other values in the right-hand side of the bound (|29j) are constants. 
Therefore, for any e > 0, we can find sufficiently small 5 > such that the right-hand side is at most 
A(c) + e. Choosing d sufficiently large for this 5, we obtain the result. □ 



7 Projection 

In this section we complete the proof of Theorem ^ by proving the existence of the limit (|14|). We use 
the limiting distribution V(d) constructed in Section and "project" it onto any random instance of 
linear program © with n x-variables and ere constraints C\ , . . . , C cn . 

Fix c > and e > and take n to be a large integer. We construct a feasible solution Xi G 
[B~,, B x ],ipj > 0, 1 < i < n, 1 < j < cn as follows. For each variable X{ consider its depth-d neighborhood 
B(xi, d, n), where d = d(e) is taken as in Proposition^ If the B(xi, t, re) is a tree T(xi, d), then we set the 
value of Xi equal to Xi(n) = K[Xi\T w (xi, d)], where the expectation is with respect to the measure V(d). 
That is we observe the depth-d tree T(xt,d) together with values Wj corresponding to the constraints 
Cj in this tree and set the values Xj to be equal to the expectation of Xi condition on this observation. 
If, on the other hand, B(xi,t,n) is not a tree, then we assign any value to x^ = Xi(n), for example 
Xi = B\. Once we have assigned values to X{ in the manner above, for every constraint Cj : a r kXi k < 
b r + Wj +ipj, we set its corresponding value of tpj to ^j(n) = max{0, a r ^Xi h — b r — Wj} - the optimal 
choice for given values of Xj-s. 
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Proposition 6 For every e > 0, the solution constructed above has expected cost at most (A(c) + e)n 
for all sufficiently large n. 



Since e was an arbitrary constant and since A(c) was defined by (|16f) the proposition shows that the 
assignment above satisfies linin^oo ~E[GCV(n, c)]/n = A(c). Therefore the proposition implies Theorem 

12 

Proof : Select one of the n variables Xi uniformly and random. W.l.g. assume it is x\. Fix eo > 
very small. We claim that when n is sufficiently large, we have 

(30) EKli(n) + ^ Yl ^ A W + 3e °' 

j 

where the summation is over all constraints Cj containing x\. First let us show that this implies 
the statement of the proposition. Indeed, multiplying the inequality by n, and recalling that x\ was 
uniformly selected from Xj, 1 < i < n, we obtain that E[w x Y^i<i< n Xi(n)+w^ ^ ■ ^j(n)] < (A(c)+3eo)n, 
where again we used the fact that each variable ^j(n) was counted exactly K times. By taking eo < e/3, 
we obtain the required bound. 

Consider the neighborhood B{x\, d + 1, n) and suppose first that it is not a tree. Denote this event 
by £ n . From the second part of Lemma[51 P{£ n } = 0(l/n), where the notation O(-) involves constants 
d, c and K. Recall, that the values of ^j(n) never exceed B^. Then we have 

EMi(n) + ^^^( n )|q 

3 

< w x max{\B x \, \B%\} + w^B^E[\ J(xi, n)\£ n ] 

/| R i| ir2ii^ r E[|J(x 1; n)l{£- n }] 
= w x max{ | B x \ , \ B x \ } + B^ , 

where l{£ n } is the indicator of an event £ n . Recall from the proof of Lemma 0J that the probability 

(on \ 
)(K/n) s (l — K/n) cn ~ s for large n. It follows that the probability 

that J(x\,n) exceeds log 2 n is at most 1/n 3 for sufficiently large n (more accurate bounds can be 
obtained JLR00 ), which are not required here). Then 

E[J( Xl ,n)l{£ n }] 

< E[J(xi,n)l{f n }l{J(xi,n) < log 2 n}} + E[J(x u n)l{£ n }l{ J(x u n) > log 2 n}] 

(31) <log 2 nP{^ n } + ^P{£: n }, 



where we used the fact that J{x\,n) < cn with probability one. But since P{£ n } = 0(l/n), we obtain 
that E[{w x X!{n) + ^ Ej = o(l). 

Suppose now that B{x\,d+ 1, n) is a tree T(xi,d), that is the event £ n occurs. Select any of the 
constraints containing x\ (if any exist), and let X2, ■ ■ ■ ,xk be the variables in this constraints. Note 
that then B(xf,, d, n) are also trees T(xk, d) and the corresponding values X^n) = K[Xk\T w (x^, d)}, k = 
2,3, ... ,K for these variables are set based only on the observed trees T{xk,d) and values Wj, that 
is exactly as Xf. where defined in the previous section. Then the same correspondence holds between 
\F, (n) and tyj. Applying Proposition and the first part of Proposition [21 that is the fact that 
B(xx,n,d) converges to T(x±,d) distributed according to V(d), we obtain that, for sufficiently large n, 
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E[tw x Xi(n) + ^ Ej < A(c) + 2e , where the second eo comes from approximating 23(xi,n,<i) by 

T(xi, d). Combining with the case of non-tree B(x\,d, n) we obtain 

E[^Xi(n) + ^^%(n)] < A(c) + 2e + o(l) < A(c) + 3e . 
i 

for sufficiently large n, just as required by (|30j) . This concludes the proof of Theorem ^ □ 



8 Applications to maximum weight 6-matchings in sparse random 
graphs 

The main goal of this section is proving Theorem We begin with a linear programming formulation 
of the maximum weight matching problem. Suppose we have (a non-random) graph with n nodes and 
m undirected edges represented as pairs of nodes. Denote by E the edge set of the graph. The 
edges are equipped with (non-random) weights < Wij < w max . Given V C [n], let 5(V) denote 
the set of edges with exactly one end point in V. A classical result from the theory of combinatorial 
optimization ( |Sch03| . Theorem 32.2) states that the following linear programming problem provides an 
exact solution (namely, it is a tight relaxation) of the maximum weight ^-matching problem: 

(32) Maximize WijXij 

hi 

subject to : 

(33) Xij <b, Vi = 1, 2, . . . , n, 

j 

(34) ^x itj + Xij < b \ V \ + ^ A \- 1 j VFC [n],AcS(V) such that + |A| is odd, 

(35) < x i>: j < 1. 

Specifically, there exists an optimal solution of this linear programming problem which is always 
integral and it corresponds to the maximum weight ^-matching. We denote the optimal value of the 
linear program above by £PM.{G). 

Our plan for proving Theorem |31 is as follows. We first show that when the graph has very 
few small cycles (and this will turn out to be the case for G(n,cn)), the optimal value CVA4°(G) 
of the modified linear program, obtained from ()32|) - (|35j) by dropping the constraints (|34[). is very 
close to £PA4(G). In the context of random graphs G(n,cn), this will imply that the difference 
CPA4(G(n, cn)) — CPA4°(G(n, en)) = o(n), w.h.p. We then take the dual of the modified linear pro- 
gram (|32|) . I)33|) . ()35|) and show that it has the form ©. Applying Theorem^we will obtain that the limit 
lim n CPA4°(G(n, cn)/n exists w.h.p. This will imply the existence of the limit lim n £VAi(G(n, cn)/n 
w.h.p. Since CVAd{G{n,cn) is the maximum weight 6-matching in G(n,cn), that is A4 w (n, c, 6), and 
Theorem 01 follows. 

Naturally, CVM°(G) > CVM{G). 

Proposition 7 Given a weighted graph G and d > 3 let L(d) denote the set of cycles of length < d in 
G and let M(d) denote the total number of edges in L{d). Then 

(36) ^—^-(CVM°(G) - M(d)w max ) < CVM{G) < CVM°(G). 
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Proof : We already observed that the right-hand side of (|36j) holds. We now concentrate on the 
left-hand side bound. 

Let x° = (x°)ij be an optimal solution of the linear program (|32|) . (|33|). (|35|) with its the optimal 
value CPAA Q (G). In the graph G we delete all the M(d) edges which contribute to L(d) together 
with the corresponding values of x^ ■. Consider the resulting solution x 1 = (xjj) to the linear program 
corresponding to the reduced weighted graph G , which now does not contain any cycles of length 
less than d. The objective value of the linear program (|32|) . (|33|) . (|35|) corresponding to the solution 
x 1 is at least CVM°(G) - M(d)w maxj since, by constraint (|35|) . x®j < 1 for all the edges (i,j)- We 
further modify the solution x 1 to x 2 by letting x| • = (1 — l/d)x\j for every edge in the graph 
G 1 . The objective value of the linear program (|33f) . (|3"5|) corresponding to x 2 is then at least 

^(CVM°(G) - M(d)w max ). We claim that in fact x 2 is a feasible solution to the linear program 
(f3"2|) - (|31)|) . implying (|51)|) and completing the proof of the proposition. 

Clearly, constraints (|33|) and (|35|) still hold. We concentrate on (|33|), Consider any set V C [n] and 
^4 C <5(V) such that b\V\ + |A| is odd. Assume first \V\ + \A\ < d. Let V denote the union of V and 
the end points of edges in A. Then \V\ < d. Since G 1 does not contain any cycles of size < d, then V 
does not contain any cycles at all, and therefore is a forrest. In particular it is a bipartite graph. Let 
x 1 denote the sub-vector of the vector x 1 corresponding to edges with both ends V. Since (|33|) holds 
for x , then it also holds for the vector x 1 for all nodes in V. A classical result from a combinatorial 
optimization theory Sch03 j states that for every bipartite graph, the polytop corresponding to the 
degree constraints (|33|) has only integral extreme points, which are 6-matchings (in the reduced graph 
V). This follows from the fact that this polytop when described in matrix form Bx < b corresponds to 
the case when B is totally unimodular. We refer the reader to |Sch03| for the details. But since any 
integral solution x 1 corresponding to the 6-matching must satisfy the constraints (|34|) by Theorem 32.2 
in Sch03 , then these constraints are automatically satisfied by x 1 . Moreover then their are satisfied by 
x 2 . We proved that (|34f) holds whenever |V| + \A\ < d. 

Suppose now |V| + \A\ > d. For the solution x 1 , let us sum the constraints (|33|) corresponding 
to all the nodes in V and sum the right-hand side constraints ()35|) corresponding to all the edges in 
A. Each value X{j for i,j £ V is counted twice, once for node % and once for node j. Each value 
Xij for S A is also counted twice, once for constraint l)33|) for the node i and once for constraint 
(jHSJ for the edge Then we obtain 2(X^ - e y xjj + Ylu,j)eA x \.j) — ^1^1 + 1^1- ^ n ^ s i m phes 

E t , 1 eV x lj+E( l , l)( ,A x l j < (l~l/d) b -^p^ < 6|V1+ j Ahl , since by assumption, b\V\ + \A\ > \V\ + \A\ > d. 
Again we showed that the constraint (|33|) holds for the solution x 2 . □ 

We return to our main setting - sparse random graph G(n, cn), with edges equipped with randomly 
generated weights Wij, drawn according to a distribution function w e ^ e (t),t ^ with support [0,-Bj. 
Let E = E(G(n,cn)) denote the edge set of this graph. We denote the value of CVAi°(G(n, cn)) by 
CVM°(n, c) for simplicity. That is, £VA4°(n, c) the optimal (random) value of the linear program Q32[). 
(|3"3"|) . (jSS)) on the graph G(n,cn). 

Proposition 8 W.h.p. as n — > oo 

(37) CVM°(n, c) - o{n) < M w (n, c, b) < CVM°{n, c). 

Proof : Let d(n) be a very slowly growing function of n. It is well known that in G(n, cn) w.h.p. 
the total number of edges which belong to at least one cycle with size < d(n) is o(n) (far more accurate 
bounds can be obtained JLROO ). Thus M(d(n)) = o(n) w.h.p. Note also that from (J5HJ, we have 
CVM°(n, c) < w max bn = 0(n). Applying Proposition [7| with d = d(n), we obtain the result. □ 
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Our final goal is proving the convergence w.h.p. of CVM°(n,c)/n. We use linear programming 
duality for this purpose. Consider the dual of the linear program (|3*2")l . (|3l5|) . (|35[). generated on the 
weighted graph G(n, cn). It involves variables yi, ■ ■ ■ ,y n and has the following form. 

(38) Minimize b yi + 

l<i<n (i,j)&E 

subject to : 

(39) yi + Vj + iPij > Wij, V (t, j) € E 

(40) Ui,1>ij>0- 

The objective value of this linear program is also CVM°(n, c), thanks to the strong duality of linear 
programming. The linear program above is almost of the form © that we need in order to apply 
Theorem Q Let us rewrite the linear programm above in the following equivalent form 

(41) Minimize b yi + tpi,j 

l<i<n (i,j)€E 

subject to : 

(42) {-l) yi + (-l) yj < -W itj + ^j, V € E 

(43) Vi,il>ij>0, 

Let the set of constraints C of the linear program (JBJ) contain only one element (— \)y\ + (—1)2/2 ^ 0. 
We set = 0, i?^ = where, as we recall, [0, B w ] is the support of the distribution of Wij. We 
also set w x = b and = 1. Our linear program has now the form © except for we need to consider 
in addition the constraints yi < B w . We claim that in fact these constraints are redundant. In fact 
any value of yi which exceeds B w > Wij can be decreased to B w , resulting in a smaller value of the 
objective function and still honoring all the constraints. Thus, we may replace < yi < B w simply 
by yi > 0. We conclude that the linear program (|41j). (|42|) . I|43|l has form @. Applying Theorem^ 
there exists a function g(c) > such that w.h.p. CVA4°(n,c)/n — > g(c) as n — > oo. Finally, applying 
Proposition |HJ we obtain (|11|) . This concludes the proof of Theorem |3 □ 

9 Discussion 

The results of the present paper lead to several interesting open questions. In addition to Conjecture 
@J stated in Section |31 it seems that the following analogue of Conjecture El is reasonable. 

Conjecture 5 Consider random K-LSAT problem (linear program ^jj) with n variables and m con- 
straints, where all ip variables are set to zero. Let N(n, m) denote the maximum cardinality subset of 
constraints Cj which is feasible. For any c > 0, the limit lim n ^ 00 (N (n, cn) /n) exists. Moreover, this 
limit is strictly smaller than unity, for all c > c* K . 

An interesting group of questions relates to the behavior of the function /(c) = lim n E,[QCV(n)]/n, 
which, by results of this paper is equal to zero for c < c* , in the specific context of linear program Q. 
What can be said about /(c) near c* K , when c* K < oo? It is not hard to show that /(c) is continuous 
and non-decreasing function of c. Is it differentiable in c* K ? Is it convex, concave or neither? Similar 
questions arise in connection with percolation probability 9(p) which is a function of a bond or site 
probability p. It is known that for every dimension d there exists a critical value p* d , for which the 
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probability 6{p) of existence of an infinite component containing the origin in d-dimensional bond/site 
percolation model, is equal to zero for p < p* d and is positive for p > p* d , (whether 8{p) = for p = p* d 
is a major outstanding problem in percolation theory), see Gri99 . The behavior of 9{p) near p* d is a 
well-known open problem. We also refer the reader to |APbj for the related discussion on scaling limits 
and universality of random combinatorial problems. 

Computing the values of c* K is another question which seems so far beyond the techniques applied 
in this paper. The cases where local weak convergence methods lead to computation of limiting values 
seem to be related to special structures of the corresponding problems. For example a very clever 
recursion was used by Aldous to prove Q{2) = 7r 2 /6 limit for the expected value of the minimum weight 
assignment, [AldOlj . See also Bandyopadhyay Ban02], who investigates some properties of the limiting 
measure arising in the proof of £(2) result. 

Similarly, a special structure was used in AS03 to prove £(3) limit for minimum spanning tree 
on a complete graph with exponentially distributed weights. (This problem was originally solved by 
Frieze |Fri85| . using combinatorial methods). Uncovering such special structure for our problem for the 
purposes of computing c* K is an interesting question. 

A separate course of investigation is to formulate and study randomly generated integer programming 
problems as a unifying framework for studying random k-SAT, coloring, maximum k-cuts , maximum 
independent sets, and other problems. For example, under which conditions on the set of prototype 
constraints does the feasibility problem experience a sharp transition? These conditions should be 
generic enough to include the problems mentioned above. Also we suspect that other results from 
polydedral combinatorics can be of use when studying these problems within an integer programming 
framework. 

Finally, it should be clear what goes wrong when one tries to use local weak convergence approach for 
random K-SAT problem, for example along the lines of Theorem^ Our approach is built on using the 
values like E[X^| •] to construct a feasible solution, but these expectations are not necessarily integers. 
Digging somewhat deeper into the issue, it seems that local weak convergence method in general is not 
very hopeful for resolving Conjecture^ since it looks only into constant size neighborhoods of nodes. To 
elaborate somewhat this point consider maximal independent set problem in r-regular random graphs, 
discussed in ASQ3| . For almost any node in such a graph, its constant size neighborhood is a r-regular 
tree, and, as such, the neighborhoods are indistinguishable. In such circumstances it seems hard to 
try to concoct a solution which is based only on neighborhoods of nodes. Some long-range structural 
properties of these graphs like structure of cycles have to be considered. We refer the reader to p\S03 
for a further discussion of this issue. 
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